Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram

نویسندگان

  • Yunyun Zhou
  • Douglas R. Call
  • Shira L. Broschat
چکیده

In this paper we present a new ab initio approach for constructing an unrooted dendrogram using protein clusters, an approach that has the potential for estimating relationships among several thousands of species based on their putative proteomes. We employ an open-source software program called pClust that was developed for use in metagenomic studies. Sequence alignment is performed by pClust using the Smith-Waterman algorithm, which is known to give optimal alignment and, hence, greater accuracy than BLAST-based methods. Protein clusters generated by pClust are used to create protein profiles for each species in the dendrogram, these profiles forming a correlation filter library for use with a new taxon. To augment the dendrogram with a new taxon, a protein profile for the taxon is created using BLASTp, and this new taxon is placed into a position within the dendrogram corresponding to the highest correlation with profiles in the correlation filter library. This work was initiated because of our interest in plasmids, and each step is illustrated using proteomes from Gram-negative bacterial plasmids. Proteomes for 527 plasmids were used to generate the dendrogram, and to demonstrate the utility of the insertion algorithm twelve recently sequenced pAKD plasmids were used to augment the dendrogram.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Whole-Proteome Analysis of Twelve Species of Alphaproteobacteria Links Four Pathogens

Thousands of whole-genome and whole-proteome sequences have been made available through advances in sequencing technology, and sequences of millions more organisms will become available in the coming years. This wealth of genetic information will provide numerous opportunities to enhance our understanding of these organisms including a greater understanding of relationships among species. Resea...

متن کامل

Genetic variation of some Iranian Hyoscyamus Landraces based on seed storage protein

The genus Hyoscyamus belongs to the tribe Hyoscyameae Miers of Solanaceae family. Variation in protein bands elaborates the relationship among the collections from various geographical regions. In this study the seed storage protein diversity of 19 accessions of Hyoscyamus (H. niger, H. reticulatus and  H. pusillus) from West Azerbaijan (Iran) was investigate...

متن کامل

Morphological, Molecular and Phytochemical Variation in Some Thyme Genotypes

Thyme is an important medicinal plant in cosmetic, pharmaceutical and food industries. The first step for breeding of thyme is evaluating of genetic variation and relationship between thyme’s accessions. Therefore, the objective of this study was to evaluate morphology, chemical and molecular variation of 13 accessions of Thyme medicinal plant. ANOVA showed significant differences between acces...

متن کامل

IRAP and REMAP based genetic diversity among varieties of Lallemantia iberica

This study describes the genetic relationships among 34 varieties of Lallemantia iberica using inter-retrotransposon amplified polymorphism (IRAP) and retrotransposon-microsatellite amplified polymorphism (REMAP). Samples were collected from Agriculture Research Center of Urmia city (northwest Iran). Ten IRAP and REMAP primers generated 76 scorable electrophoretic bands with 78.94% pol...

متن کامل

A Visualization Approach to Automatic Text Documents Categorization Based on HAC

The ability to visualize documents into clusters is very essential. The best data summarization technique could be used to summarize data but a poor representation or visualization of it will be totally misleading. As proposed in many researches, clustering techniques are applied and the results are produced when documents are grouped in clusters. However, in some cases, user may want to know t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 2013  شماره 

صفحات  -

تاریخ انتشار 2013